GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 20, 2003, 17:20:04 ; Search time 34 Seconds 

(without alignments) 
30.359 Million cell updates/sec 

Title: US-09-807-980-8 
Perfect score: 24 
Sequence: 1 GGGF 4 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL_23 : * 

1: sp_archea:* 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4: sp_human:* 

5: sp_invertebrate : * 

6: sp_mammal:*. 

7: sp_mhc:* 

8: sp_organelle : * 

9: sp_phage:* 

10: sp_plant:* 

11: sp_rodent:* 

12: sp_virus:* 

13: sp_vertebrate : * 

14: sp_unclassif ied: * 

15: sp_rvirus:* 

16: sp_bacteriap : * 

17: sp_archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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24 
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089823 budgerigar 


14 


A A 

2 4 


1 A A 
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089830 budgerigar 
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24 
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38 
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089833 


089833 budgerigar 
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24 
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. 0 


38 


12 


089831 


089831 budgerigar 
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023471 arabidopsis 
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Q63281 rattus norv 


2 4 


O A 
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0 


47 


10 


Q9S857 


Q9s857 glycine max 


•3 n 


A /I 

24 


"1 A A 

100 . 


0 


47 


16 


Q9K9F6 
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24 
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Q9gmn4 macaca fasc 


40 


24 
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41 


24 


100. 
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61 
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Q8KK56 


Q8kk56 proteus vul 


42 


24 


100. 
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61 


16 


Q8XQD3 


Q8xqd3 ralstonia s 


43 


24 


100. 
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Q25250 lucilia cup 


44 


24 


100. 
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Q25245 


Q25245 lucilia cup 


45 


24 


100. 


0 


64 
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Q25251 


Q25251 lucilia cup 



ALIGNMENTS 



RESULT 1 
005601 

ID 005601 PRELIMINARY; PRT ; 19 AA. 

AC 005601; 

DT 01-JUL-1997 (TrEMBLrel. 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last annotation update) 



DE Transposon Tn5041 DNA (Fragment) . 

OS Pseudomonas sp . 

OC Bacteria; Proteobacteria . 

OX NCBI_TaxID=306; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=KHP41; TRANS POSON=Tn5 04 1 ; 

RX MEDLINE=97419493; PubMed=9274008 ; 

RA Khloldii G.Y., Yurieva O.V., Gorlenko Z.M., Mindlin S.Z., Bass I. A., 

RA Lomovskaya O.L., Kopteva A. V. , Nikif orov V.G.; 

RT "Tn5041 : a chimeric mercury resistance transposon closely related t 

RT the toluene degradative transposon Tn4651."; 

RL Microbiology 143:2549-2556(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=KHP41; TRANSPOSON=Tn504 1 ; 

RA Kholodii G.Y., Mindlin S.Z., Gorlenko Z.M., Bass I.A.,, Kalyaeva E.S. 

RA Nikiforov V. ; 

RT "Host-dependent transposition of Tn5041."; 

RL Russ. J. Genet. -36:365-373(2000) . 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=KHP41; TRANS POSON=Tn5 04 1 ; 

RA Kholodii G. ; 

RL Submitted (SEP-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; X98999; CAA67457.1; -. 

FT NON_TER 1 ■ 1 

FT NON_TER - 19 19 

SQ SEQUENCE 19 AA; 1797 MW; 2070A562EF85EDD9 CRC64 ; 



Query Match * 100.0%; Score 24; DB 2; Length 19; 

Best Local Similarity 100.0%; Pred. No. 3.8e+02; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 GGGF 4 

I I I I 

Db 3 GGGF 6 



Search completed: October 20, 2003, 17:24:31 
Job time : 38 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: October 20, 2003, 17:19:34 ; Search time 11 Seconds 

(without alignments) 
17.101 Million cell updates/sec 



Title: US-09-807-980-8 
Perfect score: 24 
Sequence: 1 GGGF 4 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 127863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_4 1 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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9 


24 


100. 
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1 
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pyrococcus 
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ALIGNMENTS 



RESULT 1 
FBRL_RAT 

ID FBRL_RAT STANDARD; PRT; 31 AA. 

AC P22509; 

DT 01-AUG-1991 (Rel. 19, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Fibrillarin (Nucleolar protein 1) (Fragment) . 

GN FBL . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; , Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=86033920; PubMed=24 142 94 ; 

RA Lischwe M.A., Ochs R.L., Reddy R. , Cook R.G., Yeoman L.C., Tan E.M., 

RA Reichlin M., Busch H.; 

RT "Purification and partial characterization of a nucleolar scleroderma 

RT antigen (Mr = 34,000; pi, 8.5) rich in NG, NG-dimethylarginine . " ; 

RL -J. Biol. Chem. 260:14304-14310(1985). 

RN [2] 

RP SEQUENCE OF 1-28, AND INTERACTION WITH NOLC1. 

RX MEDLINE=20143579; PubMed=10679015 ; 



RA Yang Y., Isaac C, Wang C, Dragon F. , Pogacic V., Meier U.T.; 

RT "Conserved composition of mammalian box H/ACA and box C/D small 

RT nucleolar ribonucleoprotein particles and their interaction with the 

RT common factor Noppl40."; 

RL Mol. Biol. Cell 11:567-577(2000). 

CC -!- FUNCTION: FIBRILLARIN IS A COMPONENT OF A NUCLEOLAR SMALL NUCLEAR 
CC RIBONUCLEOPROTEIN PARTICLE THOUGHT TO PARTICIPATE IN THE FIRST 

CC STEP IN PROCESSING PRERIBOSOMAL RNA. IT IS ASSOCIATED WITH THE 

CC U3, U8 AND U13 SMALL NUCLEAR RNAS . 

CC -!- SUBUNIT: Interacts with Nolcl/Noppl4 0 . 

CC -!- SUBCELLULAR LOCATION: Nuclear; fibrillar region of the nucleolus. 

CC -!- SIMILARITY: BELONGS TO THE FIBRILLARIN FAMILY. 

DR PIR; A23887; A23887. 

DR InterPro; IPR000692; Fibrillarin. 



DR 


PROSITE; 


PS00566; 


FIBRILLARIN; PARTIAL. 


KW 


Nuclear 


protein; 


Methylation 


; Ribonucleoprotein; rRNA 


KW 


RNA-binding. 






FT 


MOD_RES 


8 


8 


METHYLATION (DI-) . 


FT 


MOD RES 


15 


15 


METHYLATION (DI-) . 


FT 


MOD RES 


21 


21 


METHYLATION (DI-) . 


FT 


MOD RES 


24 


24 


METHYLATION (DI-) . 


FT 


MOD RES 


28 


28 


METHYLATION (DI-) . 


FT 


MOD_RES 


31 


31 


METHYLATION (DI-) . 


FT 


CONFLICT 


2 


2 


K -> D (IN REF. 2) . 


FT 


NON TER 


31 


31 




SQ 


SEQUENCE 


31 AA; 


2965 MW; 


93EBCC102847A363 CRC64; 



Query Match 100.0%; Score 24; DB 1; Length 31; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 
Matches 4; Conservative 0; Mismatches 0; Indels 



0; Gaps 



Qy 

Db 



1 GGGF 4 

I I I I 
9 GGGF 12 



Search completed: October 20, 2003, 17:23:49 
Job time : 13 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



October 20, 2003, 17:22:04 ; Search time 15 Seconds 

(without alignments) 
25.645 Million cell updates/sec 

US-09-807-980-8 
24 

1 GGGF 4 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIRJ76:* 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


24 


100.0 


10 


2 


E41946 


T-cell receptor ga 


2 


24 


100.0 


16 


2 


PH1453 


T-cell receptor al 


3 


24 


100.0 


29 


2 


A83923 


hypothetical prote 


4 


24 


100. 0 


31 


2 


A23887 


fibrillarin - rat 


5 


24 


100.0 


31 


2 


A21919 


tricarboxylate-bin 


6 


24 


100.0 


35 


2 


S18264 


(S ) -6-hydroxynicot 


7 


24 


100.0 


42 


2 


A71429 


hypothetical prote 


8 


24 


100.0 


43 


2 


F83732 


hypothetical prote 


9 


24 


100.0 


' 44 


2 


AG2 826 


hypothetical prote 


10 


24 


100.0 


47 


2 


C83986 


hypothetical prote 


11 


24 


100.0 


57 


2 


T22546 


hypothetical prote 


12 


24 


100. 0 


60' 


2 
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GPI-anchored epidi 


13 


24 
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14 


24 


100 . 0 


c c. 
DD 


2 


t_t O n O A A 

H81804 


hypothetical prote 


15 


A A 

Zh 


i a a a 
100 . U 


ob 
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small acid-soluble 


17 


24 


100.0 


70 


2 


T42189 


f atty-acid-CoA lig 


18 


24 


100.0 


/ 0 


2 


JS0433 


F-plasmid maintena 


19 


24 


100.0 


73 


2 


JS0107 


hypothetical 7.4K 


20 


24 


100 . 0 


74 


2 


T22077 


hypothetical prote 


21 


24 


100 . 0 


74 


2 


T33087 


hypothetical prote 


22 


24 


100 . 0 


7 6 


2 


T32666 


hypothetical prote 


23 


24 


100 . 0 


78 


2 


E81116 


probable lipoprote 


24 


24 


100.0 


82 


2 


/~* f A A A A 

C64940 


transglycosylase-a 


25 


24 


100 . 0 


82 


2 


n rto 

D85790 


transglycosylase-a 


26 


24 


100 . 0 


82 


2 


AG0713 


probable membrane 


27 


24 


100 . 0 


82 


2 


H90941 


transglycosylase-a 


28 


24 


100.0 


84 


2 


H64865 


transglycosylase-a 


29 


24 


100 . 0 


84 


2 


D85698 


hypothetical prote 


30 


24 


100 . 0 


84 


2 


T*v A A O >1 A 

D9 0840 


hypothetical prote 


31 


24 


100 . 0 


84 


2 


AF0722 


probable membrane 


32 


24 


100 . 0 


84 


2 


S33999 


EP84R protein - Af 


33 


24 


100 . 0 


84 


2 


AC3279 


hypothetical prote 


34 


24 


100 . 0 


85 


2 


T32664 


hypothetical prote 


35 


24 


100 . 0 


85. 


2 


F64529 


hypothetical prote 


36 


24 


100 . 0 


85 


2 


A42056 


ribosomal protein 


37 


24 


100 . 0 


86 


2 


C71960 


hypothetical prote 


38 


24 


100 . 0 


86 


2 


F86294 


hypothetical prote 


39 


24 


100 . 0 


87 


2 


T26867 


hypothetical prote 


4 0 


Zh 


i nn a 

100 . u 


O A 

o y 


1 


XVfcjBBD 


Shigella toxin cha 


41 


24 


100.0 


89 


1 


JN0726 


Shiga-like toxin I 


42 


24 


100.0 


89 


1 


XVBPH9 


Shiga-like toxin c 


43 


24 


100.0 


89 


2 


B53887 


Shiga-like toxin I 


44 


24 


100.0 


89 


. 2 


E91000 


Shiga toxin I subu 


45 


24 


100.0 


89 


2 


G85845 


Shiga toxin I subu 



ALIGNMENTS 



RESULT 1 
E41946 

T-cell receptor gamma chain (la. 9) - mouse (fragment) 
C; Species: Mus musculus (house mouse) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb-1994 #text_change 07-May-1999 
C;Accession: E41946 

R;Whetsell, M. ; Mosley, R.L.; Whetsell, L. ; Schaefer, F.V. ; Miller, K.S.; Klei 
J.R. ' 
Mol. Cell. Biol. 11, 5902-5909, 1991 

A;Title: Rearrangement and junctional-site sequence analyses of T-cell recepto 
gamma genes in intestinal intraepithelial lymphocytes from murine athymic 
chimeras . 

A;Reference number: A41946; MUID : 92 04 9316; PMID: 1658619 
A; Accession: E41946 

A; Status : preliminary; not compared with conceptual translation 

A; Molecule type: DNA 

A; Residues: 1-10 <WHE> 

C; Keywords: T-cell receptor 



Query Match 



100.0%; Score 24; DB 2; Length 10; 



Best Local Similarity 100.0%; Pred. No. 77; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGF 4 

I I I I 

Db 7 GGGF 10 

Search completed: October 20, 2003, 17:24:52 
Job time : 17 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



October 20, 2003, 17:32:30 ; Search time 95 Seconds 

(without alignments) 
10.865 Million cell updates/sec 

US-09-807-980-8 
24 

1 GGGF 4 



Scoring table: 



Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 8 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



452 



Database 



SPTREMBL 23:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea: * 
sp_bacteria: * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc: * 
sp_organelle : * 
sp_phage : * 

sp_plant:* 

sp_rodent : * 

sp_virus : * 

sp__vertebrate : * 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


12 


50 . 0 


7 


11 


Q63480 


Q63480 rattus nory 


2 


12 


50 . 0 


7 


11 


055184 


055184 rattus norv 


3 


12 


50. 0 


8 


4 


Q15901 


Q15901 homo sapien 


4 


12 


50. 0 


8 


11 


Q9QVI5 


Q9qvi5 rattus sp. 


5 


12 


50. 0 


8 


11 


Q9R1U6 


Q9rlu6 mus musculu 


6 


11 


45 . 8 


7 


10 


049223 


049223 glycine max 


7 


11 


45.8- 


8 


6 


Q9TT78 


Q9tt78 canis famil 


8 . 


11 


45 . 8 


8 


8 


P92386 


P92386 hordeum mar 


9 


10 


41.7 


8 


5 


TV CI A /" A O 

P82688 


P82688 periplaneta 


10 


9 


i / . o 


8 


2 


Q9R5R2 


Q9r5r2 shigella dy 


11 


9 


37 . 5 


8 


7 


Q95213 


Q95213 oryctolagus 


12 


8 


33 . 3 


8 


8 


P93963 


P93963 psathyrosta 


13 


8 


33 . 3 


8 


8 


P92384 


P92384 hordeum mur 


14 


8 


33 . 3 


8 


8 


P93973 


P93973 eremopyrum 


15 


8 


33 . 3 


8 


8 


P92215 


P92215 amblyopyrum 


16 


8 


33 . 3 


8 


8 


P93957 


P93957 festucopsis 


17 


8 


33 . 3 


8 


8 


P92222 


P92222 bromus iner 


18 


8 


33 . 3 


8 


8 


P92388 


P92388 henrardia p 


19 


8 


33 . 3 


8 


8 


P92441 


P92441 thinopyrum 


20 


8 


33 . 3 


8 


8 


P92404 


P92404 lophopyrum 


21 


8 


33 . 3 


8 


8 


P93961 


P93961 psathyrosta 


22 


8 


33 . 3 


8 


8 


P93970 


P93970 eremopyrum 


23 


8 


33 . 3 


8 


8 


P93955 


P93955 festucopsis 


24 


8 


33. 3 


8 


8 


P93965 


P93965 secale stri 


25 


8 


33 . 3 


8 


8 


P92394 


P92394 hordeum vul 


2 6 


8 


33 . 3 


8 


8 


P92382 


P92382 hordeum bra 


27 


8 


33 . 3 


8 


8 


P93966 


P93966 aegilops sp 


28 


8 


33 . 3 


8 


8 


Q94VA0 


Q94va0 varanus sem 


29 


8 


33 . 3 


8 


8 


P92227 


P92227 crithopsis 


30 


8 


33. 3 


8 


8 


P92373 


P92373 haynaldia v 


31 


8 


33. 3 


8 


8 


P92211 


P92211 agropyron c 


32 


8 


33 . 3 


8 


8 


P92428 


P92428 peridictyon 


33 


8 


33 . 3 


8 


8 


P93959 


P93959 hordeum ere 


Q A 


o 
o 




0 


8 


P922 19 


P92219 australopyr 


3d 


8 


33 . 3 


8 


8 


P93985 


P93985 aegilops co 


36 


8 


33 . 3 


8 


8 


P92443 


P92443 taeniatheru 


o / 


o 
O 


33.3 


8 


8 


P92391 


P92391 heteranthel 


38 


8 


33 . 3 


8 


8 


P93981 


P93981 crithodium 


39 


8 


33 . 3 


■ 8 


8 


P93992 


P93992 australopyr 


40 


8 


33 . 3 


8 


8 


P92426 


P92426 pseudoroegn 


41 


8 


33.3 


8 


8 


P92431 


P92431 aegilops ta 


42 


8 


33.3 


8 


8 


P92422 


P92422 psathyrosta 


43 


8 


33.3 


8 


13 


Q902V5 


Q90zv5 fulica leuc 


44 


8 


33.3 


8 


13 


P82079 


P82079 limnodynast 


45 


7 


29.2 


7 


10 


P82445 


P82445 nicotiana t 



ALIGNMENTS 



RESULT 1 
Q63480 

ID Q63480 PRELIMINARY; PRT; 7 AA. 

AC Q63480; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-DEC-2001 (TrEMBLrel. *19, Last annotation update) 



DE TR4-NS orphan receptor (Fragment) . 

GN TR4 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96198747; PubMed=8 6124 8 6 ; 

RA Yoshikawa T., Makino S., Gao X.M., Xing G.Q, , Chuang D.M., 

RA Detera-Wadleigh S.D.; 

RT "Splice variants of rat TR4 orphan receptor: differential expression 

RT of novel sequences in the 5 1 -untranslated region and C-terminal 

RT domain. "; 

RL Endocrinology 137:1562-1571(1996). 

DR EMBL; U59125; AAB02827.1; 

KW Receptor. 

FT NON_TER 1 1 

SQ SEQUENCE 7 AA; 758 MW; 672AA87864005350 CRC64; 

Query Match 50.0%; Score 12; DB 11; Length 7; 

Best Local Similarity 100.0%; Pred. No. 8.3e+05; 

Matches 2; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 GG 2 

I I ' 

Db 4 GG 5 



Search completed: October 20, 2003, 17:37:47 
Job time : 98 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 20, 2003, 17:25:34 ; Search time 23 Seconds 

(without alignments) 
8.179 Million cell updates/sec 

US-09-807-980-8 
24 

1 GGGF 4 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 8 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



158 



Database 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


-18 


75. 


0 


8 


1 


LCK5 LEUMA 


P19987. leucophaea 


2 


15 


62. 


5 


8 


1 


UF06 MOUSE 


P38644 mus musculu 


3 


13 


54. 


2 


7 


1 


FAR5 V HIRME 


P42564 hirudo medi 


4 


13 


54. 


2 


8, 


1 


AL16_CARMA 


P81819 carcinus ma 


5 


13 


54. 


2 


8 


1 


ALL9 CARMA 


P81812 carcinus ma 


6 


12 


50. 


0 


4 


1 


ACH1 ACHFU 


P35904 achatina fu 


7 


12 


50. 


0 


4 


1 


OCP1 OCTMI 


P58648 octopus min 


8 


12 


50. 


0 


5 


1 


PAP2_PARMA 


P81864 pardachirus 


9 


12 


50. 


0 


5 


1 


RE32 LITRU 


P82073 litoria rub 


10 


12 


50. 


0 


6 


1 


FARP MONEX 


P41966 moniezia ex 


11 


12 


50. 


0 


8 


1 


LCK2_LEUMA 


P21141 leucophaea 


12 


12 


50. 


0 


8 


1 


LCK3_LEUMA 


P21142 leucophaea 


13 


12 


50. 


0 


8 


1 


ORMY_ORCLI 


P82455 orconectes 


14 


12 


50. 


0 


8 


1 


VGLG_HSV2B 


P81780 herpes simp 


15 


12 


50. 


0 


8 


1 


WP1_PERAT 


P83195 perkinsus a 


16 


11 


45. 


8 


8 


1 


LCK8_LEUMA 


P19990 leucophaea 


17 


11 


45. 


8 


8 


1 


LMT2_LOCMI 


P22396 locusta mig 



18 


9 


37 


5 


7 


1 


FAR1 HELTI 


P41871 


helisoma tr 


19 


9 


37 


5 


8 


1 


ALL5 CYDPO 


P82156 


cydia portion 


20 


8 


33 


3 


o 
0 


1 


FAR7 ASCSU 


P43171 


ascaris suu 


21 


7 


29 


2 


4 


1 


OCP3 OCTMI 


P58649 


octopus min 


22 


7 


29 


2 


8 


1 


AKHG GRYBI 


P14086 


gryllus bim 


23 


7 


29 


2 


8 


1 


AKH TABAT 


P14595 


tabanus atr 


24 


7 


29 


2 


8 


1 


AL15 CARMA 


P81818 


carcinus ma 


25 


7 


29 


2 


8 


1 


Tt T "1 ^1 Tl Y*Mi r "7\ 

AL17 CARMA 


P81820 


carcinus ma 


26 


7 


29 


2 


8 


1 


ALL 8 CARMA 


P81811 


carcinus ma 


27 


7 


29 


2 


o 
8 


1 


CCKN MACEU 


P30369 


macropus eu 


28 


7 


29 


2 


8 


1 


RPCH PANBO 


P08939 


pancialus bo 


2y 


b 


Z D 


U 


Q 


1 


GRWM HUMAN 


P01157 


homo sapien 


30 


6 


25 


0 


A 

4 


1 
1 


DCML_PSECH 


P19916 


pseudomonas 


31 


b 


2 D 


U 


A 

4 


1 


P/^pT TTT T"K JT7V XT 

EOS I HUMAN 


t"» r\ o t o i 

P02731 


homo sapien 


32 


6 


25 


0 


4 


1 


FAR3 HIRME 


P42562 


hirudo medi 


33 


b 


25 


0 


4 


1 


FAR 4 HIRME 


P42563 


hirudo medi 


34 


6 


25 


0 


4 


1 


FFKA ANTEL 


P58705 


anthopleura 


35 


6 


25 


0 


4 


1 


FLRF HIRME 


P42561 


hirudo medi 


36 


6 


25 


0 


4 


1 


FLRN ANTEL 


P58707 


anthopleura 


37 


6 


25 , 


0 


4 


1 


FMRF MACNI 


P01162 


macrocallis 


38 


6 


25 . 


0 


4 


1 


FYRI ANTEL 


P58706 


anthopleura 


39 


6 


25 , 


0 


5 


1 


AL 1 4 CARMA 


P81817 


carcinus ma 


40 


6 


25 . 


0 


5 


1 


EI03_LITRU 


P82099 


litoria rub 


41 


6 


25. 


0 


5 


1 


EI04 LITRU 


P82100 


litoria rub 


42 


6 


25, 


0 


5 


1 


FARP ARTTR 


P41853 


artioposthi 


43 


6 


25, 


0 


5 


1 


RE11 LITRU 


P82070 


litoria rub 


44 


6 


25, 


0 


5 


1 


RE21_LITRU 


P82071 


litoria rub 


45 


6 


25, 


0 


5 


1 


RE31 LITRU 


P82072 


litoria rub 



ALIGNMENTS 



RESULT 1 
LCK5_LEUMA 

ID LCK5_LEUMA STANDARD; PRT; 8 AA. 

AC P19987; 

DT 01-FEB-1991 (Rel. 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 30-MAY-2000 (Rel. 39, Last annotation update) 

DE Leucokinin V (L-V) . 

OS Leucophaea maderae (Madeira cockroach) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Orthopteroidea ; Dictyoptera; Blattaria; Blaberoidea; 

OC Blaberidae; Leucophaea.. 

OX NCBI_TaxID=698 8; 

RN [1] 

RP SEQUENCE. 

RC TISSUE=Head; 

RX MEDLINE=87052651; PubMed=2 877794 ; 

RA Holman G.M., Cook'B.J., Nachman R.J.; 

RT "Isolation, primary structure, and synthesis of leucokinins V and VI: 

RT myotropic peptides of Leucophaea maderae."; 

RL Comp. Biochem. Physiol. 88C: 27-30 (1987 ) . 

CC -!- FUNCTION: THIS CEPHALOMYOTROPIC PEPTIDE STIMULATES CONTRACTILE 
CC ACTIVITY OF COCKROACH PROTODEUM (HINDGUT) . 

CC -!- SIMILARITY: TO THE OTHER LEUCOKININS. 



DR PIR; JS0315; JS0315. 

KW Neuropeptide; Amidation. 

FT . MOD_RES 8 8 AMIDATION. 

SQ SEQUENCE 8 AA; 784 MW; 736365A5B9C865B8 CRC64; 



Query Match 75.0%; Score 18; DB 1; Length 8; 

Best Local Similarity 75.0%; Pred. No. 1.3e+05; 

Matches 3; Conservative 0; Mismatches 1; Indels 

Qy 1 GGGF 4 

I I I 

Db 1 GSGF 4 



Search completed: October 20, 2003, 17:36:02 
Job time : 23 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



October 20, 2003, 17:32:55 ; Search time 39 Seconds 

(without alignments) 
9.863 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-807-980-8 
24 

1 GGGF 4 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 



604 



Minimum DB seq length: 0 
Maximum DB seq length: 8 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIRJ76: * 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


18 


75.0 


5 


2 


B61445 


Leu-enkephalin - b 


2 


18 


75.0 


5 


2 


A61445 


Met-enkephalin - b 


3 


18 


75.0 


5 


2 ' 


B61168 


cocoonase (EC 3.4. 


4 


18 


75.0 


5 


2 


C53284 


T-cell receptor be 


5 


18 


75.0 


6 


2 


A41946 


T-cell receptor ga 


6 


18 


75.0 


7 


2 


A60224 


Met-enkephalin-Arg 


7 . 


18 


75.0 


. 7 


2 


E61491 


seed protein ws-5 


8 


18 


75.0 


8 


2 


JS0315 


leucokinin V - Mad 


9 


18 


75.0 


8 


2 


PT0509 


T-cell receptor be 


10 


15 


62.5 


8 


2 


PC4373 


telomeric and tetr 


11 


12 


50. 0 


3 


3 


A23751 


spinal cord peptid 


12 


12 


50.0 


4 


2 


A32480 


achatin-I - giant 


13 


12 


50.0 


4 


2 


A53284 


T-cell receptor be 



1 A 
X 4 


1 9 
XZ 


cn a 


A 

4 


o 
z 


Booz o 4 


T-cell receptor be 


X 0 


1 9 
XZ 


cn A 


A 
4 


n 
Z 


rlU / Uu 


T-cell receptor be 


1 £ 
1 0 


1 9 
XZ 


cn n 

DU . U 


A 
4 


Z 


o4 /OOZ 


ubiquitin - rat 


X 1 


1 9 
XZ 


cn A 

OU. U 


C 

0 


Z 


r* o o t c i 
Cz3 /Ol 


spinal cord peptic! 


T Q 

lo 


n 9 
x*: 


C. A a 

OU . U 


c 
0 


z 


to n o c o 

JHuzo3 


gut pentapepticie - 


1 Q 

xy 


lz 


cn n 
OU . U 


c 
0 


2 


PT026 7 


Ig heavy chain CRD 


9 a 
Z u 


1 9 

XZ 


, cn a 


c 
0 


Z 


D4 4 oz 3 


synaptosomal-assoc 


9 1 

z X 


1 9 
XZ 


cn n 
OU . U 


c 
0 


z 


n m n ^ c c\ 

PTOooy 


T-cell receptor be 


9 9 

z z 


1 9 

XZ 


c a a 


c 
0 


2 


nm n T n T 

PT0707 


T-cell receptor be 


9 ^ 
Z 0 


1 9 

xz 


a a 


c 
0 




nmn C O C 

r I Uooo 


T-cell receptor be 


Z 4 


1 9 

xz 


a a 


c 
0 


o 
Z 


r I U / X / 


T-cell receptor be 


z o 


1 9 
XZ 


c. a n 
0 U . U 


o 


Z 


JU UoOO 


lipopeptide WS1279 


z o 


1 9 
XZ 


c a a 

DU . U 


D 


z 


cn A O O A 

F 1 UZ o U 


Ig heavy chain CRD 


9 T 
Z / 


1 9 
XZ 


c a n 
0 u . u 


c 
D 


o 
Z 


nm^CI A 

PTUol4 


T-cell receptor be 


9 P 
Z 0 


1 9 
XZ 


a a 
0 U . U 


/r 
O 


Z 


nm n c I o 
PTU0l2 


T-cell receptor be 


9 Q 

z y 


1 9 

xz 


a a 
OU . U 


D 


Z 


n m n n c 

PTUdOo 


T-cell receptor be 




1 9 
XZ 


^ a a 
DU . U 


c 
D 


Z 


nm A T O A 

P 1 u /z u 


T-cell receptor be 


o J. 


1 9 
XZ 


ou . u 


O 


Z 


"DTi A C C A 

rlUObU 


T-cell receptor be 


"39 


1 9 
XZ 


OU . U 


D 


Z 


PTU /zo 


T-cell receptor be 


JJ 


XZ 


C A A 

OU. U 


O 


2 


PT0727 


T-cell receptor be 


o 4 


1 9 

xz 


£A A 

OU . u 


/- 
D 


z 


nm n T t n 

PTO 73 0 


T-cell receptor be 


o O 


1 9 


Ci A A 
D U . U 


c 
D 


z 


A43 xz y 


neuropeptide GNFFR 


O D 


1 9 
X Z 


c. A A 
0 U . U 


/ 


z 


b4Z4U / 


gramicidin S synth 


^7 
■j / 


1 9 
XZ 


c; A A 
0 U . U 


1 


n 

z 


0 0 0 n n 0 


180K exoantigen - 


o o 


1 9 
XZ 


cn A 
OU . U 


1 


z 


PTUoz 3 


T-cell receptor be 




1 9 
XZ 


A A 
OU . U 


1 


n 

z 


PTUd4z 


T-cell receptor be 


40 


12 


50. 0 


7 


2 




T— cell receptor be 


41 


12 


50.0 


7 


2 


PT0666 


T-cell receptor be 


42 


12 


50.0 


7 


2 


PT0663 


T-cell receptor be 


43 


12 


50.0 


7 


2 


PT0542 


T-cell receptor be 


44 


12 


50.0 


7 


2 


PT0683 


T-cell receptor be 


45 


12 


50.0 


7 


2 


PT0719 


T-cell receptor be 



ALIGNMENTS 



RESULT 1 
B61445 

Leu-enkephalin - blue mussel 

C; Species: Mytilus edulis (blue mussel) 

C;Date: 07-Oct-1994 #sequence_revision 07-Oct-1994 #text_change 21-Jan-2000 

C;Accession: B61445 

R; Leung, M.K.; Stefano, G.B. 

Proc. Natl. Acad. Sci. U.S.A. 81, 955-958, 1984 

A;Title: Isolation and identification of 4 enkephalins in pedal ganglia of Mytilus 
edulis (Mollusca) . 

A; Reference number: A61445; MUID : 84 144 823 ; PMID: 6583690 

A/Accession: B61445 

A;Molecule type: protein 

A; Residues: 1-5 <LEU> 

A; Experimental source: pedal ganglia 

C; Keywords: neuropeptide; opioid peptide 

Query Match 75.0%; Score 18; DB 2; Length 5; 

Best Local Similarity 100.0%; Pred. No. 2.8e+05; 

Matches 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



2 GGF 4 

II I 
2 GGF 4 



Search completed: October 20, 2003, 17:38:30 
Job time : 39 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



October 20, 2003, 17:37:55 ; Search time 66 Seconds 

(without alignments) 
9.934 Million cell updates/sec 



Title: US-09-807-980-8 
Perfect score: 24 



Sequence : 



1 GGGF 4 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 609560 seqs, 163917102 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 8 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



41226 



Database : 



Published_Applications_AA: * 

1: /cgn2_6/ptodata/2/pubpaa/US07JPUBCOMB.pep:* 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB - pep : * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB . pep : * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEWJ?UB . pep : * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB . pep : * 

7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep:* 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB . pep : * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 

10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB . pep : * 

11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB . pep : * 

12: /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 

13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB . pep : * 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB . pep : * 

15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB . pep : * 

16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB . pep : * 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB . pep : * 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution.' 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
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33, Appl 


43 


18 


75. 


0 * 


4 


12 


US-10-133-973-73 


Sequence 
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4, Appli 



ALIGNMENTS 



RESULT 1 

US-09-866-512A-1 

; Sequence 1, Application US/09866512A 

; Publication No. US20030053954A1 

; GENERAL INFORMATION: 

; APPLICANT: Meade, Thomas J 



; TITLE OF INVENTION: Magnetic Resonance Imaging Agents for the Detection of 
Physiological 

; TITLE OF INVENTION: Agents 
; FILE REFERENCE: A-58634-7 

; CURRENT APPLICATION NUMBER: US/ 09/866, 512A 

; CURRENT FILING DATE: 2001-05-24 

; PRIOR APPLICATION NUMBER: US 60/287,619 

; PRIOR FILING DATE: 2001-05-26 

; PRIOR APPLICATION NUMBER: US 08/460,511 

; PRIOR FILING DATE: 1995-06-02 

; PRIOR APPLICATION NUMBER: US 08/486,968 

; PRIOR FILING DATE: 1995-06-07 

; PRIOR APPLICATION NUMBER: US 08/971,855 

; PRIOR FILING DATE: 1997-11-17 

; PRIOR APPLICATION NUMBER: US 09/134,072 

'; PRIOR FILING DATE: 1998-08-13 

; PRIOR APPLICATION NUMBER: US 09/866,512 

; PRIOR FILING DATE: 2001-05-24 

; • PRIOR APPLICATION NUMBER: US 09/405,046 

; PRIOR FILING DATE: 1999-09-27 

; PRIOR APPLICATION NUMBER: US 60/063,328 

; PRIOR FILING DATE: 1997-10-27 

; NUMBER OF SEQ ID NOS: 17 

; SOFTWARE: Patent In version 3.1 

; SEQ ID NO 1 

; LENGTH : 4 

TYPE: PRT 

ORGANISM: Unknown 

FEATURE: 

; OTHER INFORMATION: could be from any mammal. 
US-09-866-512A-1 

Query Match 100.0%; Score 24; DB 11; Length 4; 

Best Local Similarity 100.0%; Pred. No. 5.5e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



RESULT 2 

US-10-301-499A-1 

; Sequence 1, Application US/10301499A 

; Publication No. US20030148932A1 

; GENERAL INFORMATION: 

; APPLICANT: Greene, Mark I. 

; APPLICANT: Murali, Mamachandran 

; APPLICANT: Berezov, Alan 

; TITLE OF INVENTION: COMPOUNDS THAT " BIND TO pl85 AND METHODS OF USING THE SAME 

; FILE REFERENCE: 4 040/1K399-US1 

; CURRENT APPLICATION NUMBER: US/10/301, 4 99A 

; CURRENT FILING DATE: 2002-11-21 

; PRIOR APPLICATION NUMBER: US 60/331,935 

; PRIOR FILING DATE: 2001-11-21 

; NUMBER OF SEQ ID NOS: 152 

; SOFTWARE: Patentln version 3.1 



; SEQ ID NO 1 

LENGTH: 4 

TYPE: PRT 
; ORGANISM: Artificial Sequence 

FEATURE: 

OTHER INFORMATION: synthetic peptide 
US-10-301-499A-1 

Query Match 100.0%; Score 24; DB 12; Length 4; 

Best Local Similarity 100.0%; Pred. No. 5.5e+05; 
Matches 4; Conservative 0; Mismatches 0; Indels 

Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



RESULT 3 
US-10-155-170-8 

; Sequence 8, Application US/10155170 

; Publication No. US20030171262A1 

; GENERAL INFORMATION: 

; APPLICANT: INOUE, Kazuhiro 

; APPLICANT: SUSAKI, Hiroshi 

; APPLICANT: IKEDA, Masuhiro 

; TITLE OF INVENTION: Drug Complexes 

; FILE REFERENCE: P17306 

; CURRENT APPLICATION NUMBER: US/10/155,170 

; CURRENT FILING DATE: 2002-05-28 

; PRIOR APPLICATION NUMBER: 09/147, 342B 

; PRIOR FILING DATE: 1999-03-25 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 8 

LENGTH: 4 

TYPE: PRT 
; ORGANISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION: Spacer Sequence 
US-10-155-170-8 

Query Match 100.0%;. Score 24; DB 12; Length 4; 

Best Local Similarity 100.0%; Pred. No. 5.5e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 

Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



Search completed: October 20, 2003, 17:47:29 
Job time : 68 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 20, 2003, 17:34:10 ; Search time 29 Seconds 

(without alignments) 
5.836 Million cell updates/sec 

US-09-807-980-8 
24 

1 GGGF 4 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 

Minimum. DB seq length: 0 
Maximum DB seq length: 8 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : 



61971 



Issued_Patents_AA: * 

1: /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2: /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/ 6A_COMB . pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/2/iaa/PCTUS__COMB . pep : * 

6: /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ID 
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4, Appli 


19 


18 


75 


0 


4 


1 


US-08-174-365A-79 


Sequence 
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9, Appli 
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US-08-346-613-6 
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6, Appli 
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1, Appli 
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Sequence 


3, Appli 


32 


18 


75 


0 


4 


1 
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36, Appl 
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Sequence 


9/ Appli 
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Sequence 


10, Appl 
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18 


75 
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Sequence 


35, Appl 


36 


18 


75 
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Sequence 


240, App 
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75 
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US-08-484-505A-1 


Sequence 


1, Appli 
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75 


0 
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Sequence 


1, Appli 


39 


18 


75 
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2 


US-08-484-631-240 


Sequence 


240, App 


40 


18 


75 
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US- 08-4 42-4 61D- 35 


Sequence 


35, Appl 


41 


18 


75 
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2 


US-08-689-018-1 


Sequence 


1, Appli 


42 ' 
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75 
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2 


US-08-689-018-3 


Sequence 


3, Appli 


43 


18 


75 
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Sequence 


16, Appl 


44 
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75 
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Sequence 
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45 
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US-08-670-207-1 


Sequence 
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ALIGNMENTS 



RESULT 1 

US-09-147-342B-8 

; Sequence 8, Application US/09147342B 

; Patent No. 6436912 

; GENERAL INFORMATION: 

; APPLICANT: INOUE, Kazuhiro 

; APPLICANT: SUSAKI, Hiroshi 

; APPLICANT: IKEDA, Masuhiro 

; TITLE OF INVENTION: Drug Complexes 

; FILE REFERENCE: P 173 06 

; CURRENT APPLICATION NUMBER: US/09/ 14 7 , 342B 

; CURRENT FILING DATE: 1999-03-25 

; PRIOR APPLICATION NUMBER: PCT/ JP97/01914 

; PRIOR FILING DATE: 1997-06-05 

; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 8 
LENGTH : 4 



; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE: 

; OTHER INFORMATION: Spacer Sequence 
US-09-147-342B-8 

Query Match 100.0%; Score 24; DB 4; Length 4; 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



RESULT 6 

US-07-969-307A-11 

; Sequence 11, Application US/07969307A 
; Patent No. 5463022 
; GENERAL INFORMATION: 

APPLICANT: INOUE, ET AL. 
; TITLE OF INVENTION: N-ACETYLCARBOXYMETHYLCHITOSAN 

; TITLE OF INVENTION: DERIVATIVES AND PROCESS FOR PREPARATION THEREOF 

; NUMBER OF SEQUENCES: 20 

CORRESPONDENCE ADDRESS : • 

ADDRESSEE: LARSON AND TAYLOR 
STREET: 727 SOUTH 23RD STREET 
; CITY: ARLINGTON 

; STATE: VA 

; COUNTRY: USA 

ZIP : 22202 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette, 3.50 inch, 720 Kb storage 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/07/969, 307A 

; FILING DATE: 16-FEB-1993 

; CLASSIFICATION: 530 

; ATTORNEY/AGENT INFORMATION: 
; NAME: GILLIS, WALTER C. 

; REGISTRATION NUMBER: 22086 

; REFERENCE/DOCKET NUMBER: IV/XX 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 703-920-7200 
; TELEFAX: 703-892-8428 

; INFORMATION FOR SEQ ID NO: 11: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 6 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: peptide 

HYPOTHETICAL: NO 
; * ANTI-SENSE: NO 

FRAGMENT TYPE: N- terminal 
US-07-969-307A-11 



Query Match 100.0%; Score 24; ■ DB 1; Length 6; 

Best Local Similarity 100.0%; Pred. No. 2.5e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



Search completed: October 20, 2003, 17:39:07 
Job time : 31 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: October 20, 2003, 17:17:59 ; Search time 42 Seconds 

(without alignments) 
15.117 Million cell updates/sec 



Title: US-09-8 07-98 0-8 

Perfect score: 24 
Sequence: 1 GGGF 4 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 1107863 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



■Database : A_Geneseq_19 Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1980 . DAT : * 

2 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 1 . DAT : * 

3 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 . DAT : * 

4 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 . DAT: * 
5: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 .DAT: * 
6 : / SI DSl/gcgdata/geneseq/ genes eqp-embl/AAl 985 . DAT : * 
7 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986 . DAT : * 
8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1987 . DAT : * 
9 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA198 8 . DAT : * 
10 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT : * 
11 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 . DAT : * 
12 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1991 . DAT : * 
13 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 . DAT : * 
14 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 . DAT : * 
15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 . DAT : * 
16: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 . DAT: * 
17 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1996 . DAT: * 
18 : / SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 . DAT: * 
19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 . DAT : * 
20 : /SIDSl/gcgdata/ geneseq/geneseqp-embl/AA1999 . DAT : * 
21 : /SIDSl/gcgdata/ geneseq/geneseqp-embl/AA2000 . DAT: * 
22 : / SIDSl/gcgdata/geneseq/ genes eqp-embl/AA2 001 . DAT : * 
23 : / SIDSl/gcgdata/geneseq/ genes eqp-embl/AA2 002 . DAT : * 
24 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 . DAT: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



Jo . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


24 


100. 


0 


4 


20 


AAY044 03 


Spacer peptide #6. 


2 


24 


100. 


0 


4 


21 


AAY91119 


Drug delivery syst 


3 


24 


100. 


0 


4 


21 


AAY49437 


Spacer peptide mol 


4 


24 


100. 


0 


4 


23 


ABG32178 


Drug complex assoc 


5 


24 


100. 


0 


4 


23 


ABB06035 


Spacer peptide #8. 


6 


24 


100. 


0 


4 


24 


ABP55660 


Target substance/p 


7 


24 


100. 


0 


5 


20 


AAY04402 


Spacer peptide #5. 


8 


24 


100. 


0 


5 


21 


AAB03528 


Drug delivery syst 


9 


24 


100. 


0 


5 


22 


AAU 03577 


ICC13 pentapeptide 


10 


24 


100. 


0 


5 


23 


ABG32175 


Drug complex assoc 


11 


24 


100. 


0 


5 


23 


ABB06032 


Spacer peptide #5. 


12 


24 


100. 


0 


6 


16 


AAR64944 


Peralkylated oligo 


13 


24 


100. 


0 


6 


16 


AAR64 94 5 


Peralkylated oligo 


14 


24 


100. 


0 


7 


22 


AAU08364 


Antibody heavy cha' 


15 


24 


100. 


0 


7 


22 


AAB82893 


Anti-human CD154 a 


16 


24 


100. 


0 


8 


19 


AAW54524 


High affinity Kb b 


17 


24 


100. 


0 


9 


16 


AAR72894 


E. coli PPIase-bet 


18 


24 


100. 


0 


9 


16 


AAR72938 


E. coli PPIase-bet 


19 


24 


100. 


0 


9 


17 


AAR99268 


Leukocyte cell-der 


20 


24 


100. 


0 


9 


23 


AAU95037 


Human novel protei 


21 


24 


100. 


0 


9 


24 


ABR17409 


Human cancer-relat 


22 


24 


100. 


0 


9 


24 


ABR17 442 


Human cancer-relat 


23 


24 


. 100. 


0 


9 


24 


ABR17463 


Human cancer-relat 


24 


24 


100. 


0 


9 


24 


ABR17597 


Human cancer-relat 


25 


24 


100. 


0 


9 


24 


ABR17822 


Human cancer-relat 


26 


24 


100. 


0 


9 


24 


ABR17994 


Human cancer-relat 


27 


24 


100. 


0 


9 


24 


ABR18 04 5 


Human cancer-relat 


28 


24 


100. 


0 


' 9 


24 


ABR18 171 


Human cancer-relat 


29 


24 


100. 


0 


9 


24 


ABR18 195 


Human cancer-relat 


30 


24 


100. 


0 


9 


24 


ABR18210 


Human cancer-relat 


31 


24 


100. 


0 


9 


24 


ABR18 393 


Human cancer-relat 


32 


24 


100. 


0 


9 


24 


ABR18 601 


Human cancer-relat 


33 


24 


100. 


0 


9 


24 


ABR18620 


Human cancer-relat 


34 


24 


100. 


0 


9 


24 


ABR18622 


Human cancer-relat 






100. 


0 




i fi 

JL D 


ZlADT 9Q4Q 


Ih . COll rrldSc alp 


36 


24 


100. 


0 


10 


16 


AAR72905 


E. coli PPIase-alp 


37 


24 


100. 


0 


10 


22 


AAG94804 


Human complementar 


38 


24 


100. 


0 


10 


22 


AAG94806 


Human complementar 


39 


24 


100. 


0 


10 


22 


AAG94808 


Human complementar 


40 


24 


100. 


0 


10" 


22 


AAG97296 


Human complementar 


41 


24 


100. 


0 


10. 


22 


AAG97298 


Human complementar 


42 


24 


100. 


0 


10 


22 


AAG97300 


Human complementar 


43 


24 


100. 


0 


10 


22 


AAG83808 


Arabidopsis thalia 


44- 


24 


100. 


0 


10 


22 


AAG83983 


Arabidopsis thalia 


45 


24 


100. 


0 


10 


22 


AAG84059 


Arabidopsis thalia 



ALIGNMENTS 



RESULT 1 
AAY04403 

ID AAY04403 standard; peptide; 4 AA. 
XX 

AC AAY04403; 
XX 

DT 25-JUN-1999 (first entry) 
XX 

DE Spacer peptide #6. 
XX 

KW Spacer; medicine complex; carboxyl-4C alkyl pullulan polyalcohol; 

KW tumour. 

XX 

OS Synthetic. 
XX 

PN JP11092405-A. 
XX 

PD 06-APR-1999. 
XX 

PF 19-SEP-1997; 97 JP-0254780 . 
XX 

PR 19-SEP-1997; 97 JP-0254780 . 
XX 

PA (DAUC ) DAI I CHI PHARM CO LTD. 

PA (DDSK-) DDS KENKYUSHO KK. 
XX 

DR WPI; 1999-283504/24. 
XX 

PT New medicine complex comprising pullulan alcohol - useful for its 

PT tumour site selectivity 

XX 

PS Disclosure; Page 6; 12pp; Japanese. 
XX 

CC The present invention describes a medicine complex comprising a 

CC carboxyl-4C alkyl pullulan polyalcohol combined with the residue of a 

CC medical compound through a spacer consisting of an amino acid or a 

CC spacer especially of 2 to 8 amino acids peptide-bonded . Also described 

CC are: (1) a carrier for medicine delivery for combining a medical 

CC compound consisting of a carboxyl-4C alkyl pullulan polyalcohol; and (2) 

CC use of a carboxyl-4C alkyl pullulan polyalcohol for the preparation of a 

CC medicine complex containing a carboxyl-4C alkyl pullulan polyalcohol 

CC combined to the residue of a medical compound optionally through a 

CC spacer. The medicine complex is useful for its tumour site selectivity. 

CC The present sequence represents an example of a spacer peptide given in 

CC the present invention. 

XX 

SQ Sequence 4 AA; 

Query Match 100.0%; Score 24; DB 20; Length 4; 
Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GGGF 4 
I I I I 

Db 1 GGGF 4 



RESULT 2 
AAY91119 

ID AAY91119 standard; peptide; 4 AA. 
XX 

AC AAY91119; 
XX 

DT 06-OCT-2000 (first entry) 
XX 

DE Drug delivery system compound N-terminal tetrapeptide #2. 
XX 

KW Drug delivery system; DDS; saccharide; peptidase; cytostatic; 

KW antiinflammatory; modified carboxy 1-4C alkyldextran polyalcohol; 

KW hydrolysate; antitumour; liver cancer. 

XX 

OS Unidentified. 
XX 

PN WO200025825-A1. 
XX 

PD ll-MAY-2000. 
XX 

PF 29-OCT-1999; 99WO- JP06016 . 
XX 

PR 30-OCT-1998; 98 JP-0310130 . 

PR 19-NOV-1998; 98 JP-0329272 . 
XX 

PA (DAUC ) DAIICHI PHARM CO LTD. 
XX 

PI Susaki H, Inoue K, Kuga H, Ikeda M, Shiose Y, Korenaga H; 

XX ' , 

DR WPI; 2000-365409/31. 

XX 

PT New drug delivery system compounds comprise saccharide compound 

PT modified carboxy alkyldextran polyalcohol bonded to antitumor or 

PT antiinflammatory agent 
XX 

PS Claim 33; Page 52; 64pp; Japanese. 
XX 

CC The present invention describes a drug delivery system (DDS) compound 

CC comprising a saccharide compound modified carboxy 1-4C alkyldextran 

CC polyalcohol bonded to a drug compound. Also described is a method of 

CC assaying DDS compounds with a drug bonded to a polymer carrier via a 

CC spacer containing 2-8 amino acids, comprising assaying a hydrolysate 

CC obtained by treating the DDS compound with a peptidase. The compound 

CC is used as a drug delivery system for administering e.g. antitumour 

CC and antiinflammatory drugs, especially for treating liver cancer. The 

CC assay can be used to monitor distribution and blood levels of the drug 

CC to allow accurate dosing. The carrier increases bioavailability of drug 

CC and allows the drug levels in the body to be readily assayed. The 

CC present sequence represents a specifically claimed N-terminal 

CC tetrapeptide which is used in the exemplification of the present 

CC invention. 
XX 

SQ Sequence 4 AA; 

Query Match 100.0%; Score 24; DB 21; Length 4; 
Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



RESULT 3 


AAY49437 


ID 


AAY4y4o/ standard; peptide; 4 AA. 


XX 




AC 


AAY4 y 4 o / ; 


XX 




DT 


1/-MAR-^UUU (tirst entry) 


XX 




DE 


Spacer peptide molecule in a drug composite. 


XX 




TV-T.T 

KW 


Drug composite; spacer; drug delivery system; antitumor; 


KW 


antiinflammatory. 


XX 




OS 


Synthetic. 


XX 




irN 


WO 3 y D 1 U D 1 - Al . 


XX 






uz-utjC- iy y y . 


XX 




DT? 

r r 


zi jyiAi xyyy; yywu-JrUzbol . 


XX 




DD 
z t\ 


9 9-MAY-1 QQfl • Qfl TD— 0 1 A 0 Q 1 ^ 


YY 




PA 


/HATTr* ^ nATTPHT PH^PM fTi T TPi 
\ latiu ^ y unliuni xrrLrt.r\i v l ^ w til u . 


YY 
AA 




PT 


ou.oa.jvi n, inoue r\ r ]\uga r\, 


YY 

AA 






WPT • 9000 — 079 ^0 /Ofi 


YY 




ir J. 


urug composite comprises carrier Douna to drug via spacer userui as 


IT 1 


drug delivery systems — 


XX 




D O 


Claim 1/; Page 41; oipp; Japanese. 


XX 




cc 


The invention provides drug composites comprising a polymer carrier 


cc 


bound through an amino acid spacer to a drug. The drug composite 


cc 


comprises a compound of formula A-R-NH-Y-CH2-0-C0-Q where, A = polymer 


cc 


carrier for a drug; R = spacer comprising 1-8 amino acid molecules bound 


cc 


to each other through a peptide linkage; Y = optionally substituted 


cc 


phenylene; Q = residue of drug. The drug composites are used as drug 


cc 


delivery systems for antitumor or antiinflammatory agents. The composites 


cc 


give rapid and regioselective release of drug thus increasing activity 


cc 


and maximum tolerated dose of drug. The present sequence represents a 


cc 


spacer peptide of the drug composite. 


XX 




SQ 


Sequence 4 AA; 



Query Match 100.0%; Score 24; . DB 21; Length 4; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



* 



Qy 1 GGGF 4 

I I I I 

Db 1 GGGF 4 



RESULT 4 
ABG32178 

ID ABG32178 standard; Peptide; 4 AA. 
XX 

AC ABG3217 8; 
XX 

DT 29-NOV-2002 (first entry) 
XX 

DE Drug complex associated peptide spacer #8. 
XX 

KW Drug complex; spacer; carboxy ( 1-4C) alkyldextran polyalcohol; cancer; 

KW inflammation; antineoplastic; antiinflammatory. 

XX 

OS Synthetic. 
XX 

PN US6436912-B1. 
XX 

PD 20-AUG-2002. 
XX 

PF 25-MAR-1999; 99US-0147342 . 
XX 

PR 06-JUN-1996; 96 JP-0144421 . 

PR 05-JUN-1997; 97WO- JP01914 . 
XX 

PA (DAUC ) DAIICHI PHARM CO LTD. 

PA (DRUG-) DRUG DELIVERY SYSTEM INST LTD. 

XX 

PI Inoue K, Susaki H, Ikeda M, Kuga H, Kumazawa E, Togo A; 
XX 

DR WPI; 2002-673238/72. 
XX 

PT A drug complex used as antineoplastic or antiinflammatory, comprises a 

PT carboxy (1-4C) alkyldextran polyalcohol residue bound to a drug compound 

PT via an amino acid or 2-8 amino acid peptide spacer 
XX 

PS Example 2; Column 15; 41pp; English. 
XX 

CC The invention describes a drug complex comprising a 

CC carboxy (1-4C) alkyldextran polyalcohol and a residue of a drug compound 

CC bound to each other with a spacer comprising an amino acid or 

CC peptide-bonded 2-8 amino acids. The drug complex is used as a medicament 

CC for the treatment of various cancers or inflammation. The drug complex is 

CC capable of site-selectively delivering the antineoplastic or 

CC antiinflammatory to a tumorous site or inflammatory site. The drug can be 

CC retained in the blood for a long period of time and has reduced toxicity. 

CC Since protease is expressed at tumorous and inflammatory sites, the 

CC spacer used here is readily hydrolyzed to allow a released drug compound 

CC to exhibit its efficiency. This sequence represents a peptide spacer used 

CC in the creation of a drug complex for treatment of inflammation and 

CC cancer. 

XX 

SQ Sequence 4 AA; 



Query Match 100.0%; Score 24; DB 23; Length 4; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 

Db 



1 GGGF 4 

I I I I 
1 GGGF 4 



AAR64944; 

2 5-MAR-2 0 03 ( updated ) 
12-SEP-1995 (first entry) 

Peralkylated oligopeptide. 

Peralkylated oligopeptides; binding assays; hydrolysis resistance; 
antimicrobial drug design. 



RESULT 12 
AAR64944 

ID AAR64944 standard; peptide; 6 AA. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 

PI 

XX 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 



Synthetic . 
Key 

Modified- site 
Modified- site 
Modified- site 



WO9500539-A1. 

05-JAN-1995. 

10-JUN-1994; 

17-JUN-1993;. 
09-JUN-1994; 



Location/Qualifiers 

1 . 

/note= "trimethylammonium N-terminal nitrogen" 
6 

/note= "C-terminal carboxamide group" 
1. . 6 

/note= "A methyl group on each amido nitrogen of 
the backbone and appropriately methylated 
side chains, at total of 8 added methyls." 



94WO-US06554. 

93US-0079144. 
94US-0257782. 



(HOUG-) HOUGHTEN PHARM INC. 

Blondelle S, Houghten R, Ostresh JM, Houghten RA; 
WPI; 1995-052001/07. 

Sets and libraries of per : alkylated oligopeptide derivs . - 
including new cpds . , with selected amino acids at partic. 
positions, used in binding assays to identify sequence with 
optimum binding to e.g. cellular receptors. 

Example 4; Page 105; 162pp; English. 



CC AAR64940-R64959 are peralkylated oligopeptides (POs), which 

CC together comprise a PO library. PO libraries can be used to 

CC assay peptides for preferential binding to an acceptor, esp. for 

CC drug (specifically antimicrobial) design. The POs are reisitant 

CC to enzymatic hydrolysis, also POs that are toxic to gram positive 

CC bacteria do not cause haemolysis of human blood. 

CC (Updated on 25-MAR-2003 to correct PN field.) 

XX 

SQ Sequence 6 AA; 

Query Match 100.0%; Score 24; DB 16; Length 6; 

Best Local Similarity 100.0%; Pred. No. 9.3e+05; 

Matches 4; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 GGGF 4 

I I I I 

Db 2 GGGF 5 



Search completed: October 20, 2003, 17:23:31 
Job time : 44 sees 



